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Abstract 

In wireless channels, the path loss exponent (PLE) has a strong impact on the quality of links, and hence, 
it needs to be accurately estimated for the efficient design and operation of wireless networks. In this paper, we 
address the problem of PLE estimation in large wireless networks, which is relevant to several important issues in 
networked communications such as localization, energy-efficient routing, and channel access. We consider a large 
ad hoc network where nodes are distributed as a homogeneous Poisson point process on the plane and the channels 
are subject to Nakagami-?n fading. We propose and discuss three distributed algorithms for estimating the PLE 
under these settings which explicitly take into account the interference in the network. In addition, we provide 
simulation results to demonstrate the performance of the algorithms and quantify the estimation errors. We also 
describe how to estimate the PLE accurately even in networks with spatially varying PLEs and more general node 
distributions. 



I. Introduction 

The wireless channel presents a formidable challenge as a medium for reliable high-rate communication. 
It is responsible not only for the attenuation of the propagated signal but also causes unpredictable spatial 
and temporal variations in this loss due to user movement and changes in the environment. In order 
to capture all these effects, the path loss for RF signals is commonly represented as the product of a 
deterministic distance component (large-scale path loss) and a randomly-varying component (small-scale 
fading) [1]. The large-scale path loss model assumes that the received signal strength falls off with distance 
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according to a power law, at a rate termed the path loss exponent (PLE). Fading describes the deviations 
of the received signal strength from the power-law decay due to shadowing and the constructive and 
destructive addition of its multipath components. While the small-scale fading behavior of the wireless 
channel can be well-represented using stochastic processed [2], it is critical to accurately estimate the 
PLE for the efficient design and operation of wireless networks. 

This estimation problem is non-trivial even for a single link due to the existence of multipath propagation 
and thermal noise. For large wireless networks without infrastructure, the problem is further complicated 
due to the following reasons: First, the achievable performance of a typical ad hoc or sensor network 
is not only susceptible to noise and fading, but also to interference due to the presence of simultaneous 
transmitters. Dealing with fading and interference simultaneously is a major difficulty in the estimation 
problem. Second, the distances between nodes themselves are subject to uncertainty. Often, the distribution 
of the underlying point process can be statistically determined, but precise locations of the nodes are harder 
to measure. In such cases, we will need to consider the fading and distance ambiguities jointly, i.e., define 
a spatial point process that incorporates both. 

In this paper, we present three distributed algorithms to accurately estimate the channel's PLE for 
large wireless networks with uniformly randomly distributed nodes in the presence of fading, noise and 
interference, based solely on received signal strength measurements. We also provide simulation results to 
illustrate the performance of the methods and study the estimation error. Additionally, we briefly describe 
how to accurately estimate the PLE in environments with spatially varying PLE values and for more general 
node distributions. The remainder of the paper is structured as follows. Section [II] provides a few examples 
to motivate the importance of knowing the PLE for the analysis and design of communication systems 
and discusses the prior work on the estimation problem. Section [HI] presents the system and channel 
models. Section [IV] describes three distributed algorithms for PLE estimation, each based on a specific 
network characteristic, and provides the mean squared error (MSE) performance of the algorithms based 
on simulation. Section [V] suggests two simple ways to improve the accuracy of the estimation algorithms. 
Section [VI] briefly discusses the sensitivity of the algorithms to variations in the system model, and Section 
IVIII concludes the paper. 

'While modeling wireless channels, the small-scale fading amplitude is often assumed to be distributed as a Rayleigh, Rician or a 
Nakagami-m random variable. 
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II. Motivation and Related Work 

A. Motivation 

In this section, we illustrate the importance of knowing the PLE for efficient design and operation of 
wireless networks. Though it is often assumed in analysis and design problems that the value of the PLE 
is known a priori, this is not true in practice, and hence, the PLE needs to be accurately estimated during 
the network initialization phase. 

The PLE estimation problem is closely related to that of localization, which is an integral component of 
network self-configuration. When bestowed with the ability to detect their positions, sensor nodes deployed 
in an ad hoc fashion can support a rich set of geographically aware protocols and accurately report the 
regions of detected events. Detailed knowledge of the nodes' locations is also needed for performing 
energy-efficient routing of packets in the network. An important class of localization algorithms is based 
on received signal strength (RSS) measurements [3], [4], but it needs accurate estimates of the PLE 
to perform well. Another fundamental issue in sensor networks is the sensing problem that deals with 
how well a target area or a phenomenon can be monitored. Studying network characteristics such as 
connectivity is important for such applications and requires accurate estimates of the PLE [5], [6]. 

A good knowledge of the PLE is also essential for designing efficient networks. In [7], the authors 
discuss capacity results for TDMA-based linear networks and show that the optimum number of hops to 
achieve a desired end-to-end rate strongly depends on the PLE. For example, when the desired (bandwidth- 
normalized) spectral efficiency exceeds the PLE, single-hop transmission outperforms multihopping. Many 
of the existing results on capacity scaling for large ad hoc networks strongly depend on the PLE as 
well. With 7 being the PLE, the best known achievability result [8] states that for a network having n 
uniformly randomly located nodes on the plane, the capacity scales as n 2 " 7 / 2 for 2 < 7 < 3 and as ^/n 
for 7 > 3. Depending on the value of the PLE, appropriate routing strategies (nearest-neighbor hopping 
or hierarchical cooperative schemes) may be implemented to reach the maximally achievable scaling of 
the system throughput. 

Energy consumption in wireless networks is a crucial issue that needs to be addressed at all the layers of 
the communication system. In [9], the author analyzes the energy consumed for several routing strategies 
that employ hops of different lengths in a large network with uniformly randomly distributed nodes. Using 
the results therein, we demonstrate that a good knowledge of the PLE is necessary for efficient routing. 
Consider the following two simple schemes where communication is assumed to occur only in a sector 
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cf> around the source-destination axis. 

1) Route across n nearest neighbor hops in a sector </>, i.e., the next-hop node is the nearest neighbor 
that lies within ±0/2 of the axis to the destination. 

2) Transmit directly to the n'th nearest neighbor in the sector </>. Here, n' is chosen in a way that the 
expected progress is the same for both schemes. 

From [9], the ratio of the consumed energies for the two schemes is obtained as 

E l _ n 2 T(l +7/2)r(n') 
E~ 2 ~ fV + 7 /2) ' 

where T(.) represents the gamma function and n' = j(n 2 — 1) + 1. Observe that the PLE plays an important 
role in determining energy-efficient routing strategies. In particular, when 7 is small, scheme 2 consumes 
less energy while relaying is more beneficial at high PLE values. 

The performance of contention-based systems such as slotted ALOHA is very sensitive to the contention 
probability p, hence it is critical to choose the optimal operating point of the system. The value of the 
contention parameter is determined based on various motives such as maximizing the network throughput 
[10] or optimizing the spatial density of progress [11, Eqn. 5.6]. These quantities also greatly depend on 
the PLE, and therefore the optimal value of the contention probability can be chosen only after estimating 

7- 

B. Review of Literature 

In this section, we survey some of the existing PLE estimation methods in the literature. Most authors 
have assumed a simplified channel model consisting only of a large-scale path loss component and a 
shadowing counterpart, but we are not aware of any prior work that has considered fading, and, most 
importantly, interference in the system model. Therefore, much of the past work on PLE prediction has 
focused mainly on RSS-based localization techniques. However, ignoring interference in the system model 
is not realistic, in particular since PLE estimation needs to be performed before the network is organized. 

Estimation based on a known internode distance probability distribution is discussed in [12]. The 
authors assume that the distance distribution pr(t) between neighboring nodes is known or can be 
determined easily. With the transmit power equal to P [dBm] (assume this is a constant for all nodes), 
the theoretic mean RSS averaged over neighboring node pairs (in the absence of interference) is P = 
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P Er [-R -7 ]- E.g., if the internodal distance distribution is (i.i.d. ) Rayleigrfl with mean R , then we have 
P = P (2i? / v /7r)~ 7 T(l — 7/2). The value of 7 is estimated by equating P to the empirical mean value 
of the RSS measured over several node pairs. 

If the nearest-neighbor distribution is in a complicated form that is not integrable, an idea similar to the 
quantile-quantile plot can be used [12]. For cases where it might not be possible to obtain the neighbor 
distance distribution, the idea of estimating 7 using the concept of the Cayley-Menger determinant or the 
pattern matching technique [12] is useful. 

In [13], the authors consider a network where the path loss between a few low-cost sensors is measured 
and stored for future use. They then propose an algorithm that employs interpolation techniques to estimate 
the path loss between a sensor and any arbitrary point in the network. In [14], a PLE estimator based on the 
method of least squares is discussed and used in the design of an efficient handover algorithm. However, 
as described earlier, the situation is completely different when interference and fading are considered and 
these purely RSS-based estimators cannot be used. Also, none of the prior estimation algorithms works 
in a fully distributed nature. 

III. System Model 

We consider an infinite planar ad hoc network, where nodes are distributed as a homogeneous Poisson 
point process (PPP) $ of density A. Therefore, the number of points lying in a Borel set B, denoted by 
$(5), is Poisson-distributed with mean \v 2 (B), where u 2 (-) is the two-dimensional Lebesgue measure 
(area). Also, the number of points in disjoint sets are independent random variables. The PPP model for 
the node distribution is ubiquitously used and may be justified by claiming that sensor nodes are dropped 
from an aircraft in large numbers; for mobile ad hoc networks, it may be argued that terminals move 
independently of each other. 

The attenuation in the channel is modeled as the product of the large-scale path loss with exponent 7 
and a flat block- fading component. Also, the noise is taken to be AWGN with mean power N . To obtain 
concrete results, the fading amplitude H is taken to be Nakagami-m distributed. Letting m = 1 results in 
the well-known case of Rayleigh fading, while lower and higher values of m signify stronger and weaker 
fading scenarios respectively. The case of no fading is modeled by setting m — > 00. When dealing with 

2 When nodes are arranged uniformly randomly on the plane, the nearest-neighbor distance is Rayleigh distributed [9]. 



signal powers, we use the power fading variable denoted by G = H 2 . The pdf of G is given by [15] 

Pg(%) = tt; — r-x m_1 exp(— mx), m > 1/2, (1) 
r(m) 



and its moments are 



E G [G n } = T{m + n l neR + . (2) 
L J m n Y (m) 



Note that G captures the random deviation from the large-scale path loss, thus Eg[G] = 1. 

Since the PLE estimation is usually performed during network initialization, it is reasonable to assume 
that the transmissions in the system during this phase are uncoordinated. Therefore, we take the channel 
access scheme to be ALOHA. We shall see in Section |V] that even if other MAC schemes were available, 
ALOHA is a good choice of MAC protocol since employing it minimizes the spatio-temporal correlations 
in the interference, which helps improve the estimation accuracy. We denote the ALOHA contention 
probability by a constant p. Therefore, nodes independently decide to transmit with probability p or 
remain idle with probability 1 — p in any time sloo Consequently, the set of transmitters at any given 
moment forms a PPP $' of density Xp. Also, since there is no information available for power control, 
we assume that all the transmit powers are equal to unity. Then, the interference at node y is given by 

U{y) = Y,G Z y\\z-y\\-\ 
ze<E>' 

where G zy is the fading gain of the channel from z to y, and || ■ || denotes the Euclidean distance. 

We define the communication from the transmitter at x to the receiver at y to be successful if and only 
if the signal-to-noise and interference ratio (SINR) at y is larger than a threshold 0, which depends on 
the modulation and coding scheme and the receiver structure. Mathematically speaking, an outage at a 
receiver at y occurs if and only if 

G xy \\x-y\r < 0; (3) 



No + h'\{ X }(y) 

where I&\{ x }(y) denotes the interference in the network at y and x is the desired transmitter. 

IV. Path Loss Exponent Estimation 

This section describes three fully distributed algorithms for PLE estimation, each based on a certain 
network characteristic, and provides simulation results on the estimation errors. The first algorithm uses 

3 The beginning and ending times of a slot is based on the individual node's clock cycle. Thus, time slots across different nodes need not 
(and in general, will not) be synchronized. We will only assume that the duration of the slots are the same. 



the mean interference value and assumes the network density A to be known. Algorithms 2 and 3 are 
based on outage probabilities and the network's connectivity properties, respectively, and do not require 
knowledge of A or the Nakagami parameter m. 

The PLE estimation problem is essentially tackled by equating the empirical (observed) values of the 
aforementioned network characteristics to the theoretically established ones to obtain the estimate 7. In 
each time slot, nodes either transmit (w.p. p) or listen to record measurements (w.p. 1 —p). After obtaining 
the required measurement values over several time slots, the estimation process can be performed at each 
node in a distributed fashion. 

The simulation results provided in this paper are obtained using MATLAB. To analyze the mean error 
performance of the algorithms, we use estimates resulting from 50000 different realizations of the PPP. 
Each PPP is generated by distributing a Poisson number of points uniformly randomly in a 50 x 50 square 
with density 1. To avoid border effects, we use the measurements recorded at the node lying closest to the 
center of the network. The accuracy of the algorithms is characterized using the relative MSE, defined as 
E [(7 — 7) 2 ] /j. The contention probability is taken to be p = 0.05 in each caseQ and iVo = —25 dBm. 

A. Algorithm 1: Estimation Using the Mean Interference 

In many situations, the network density is a design parameter and hence known. In other cases, it is 
possible to estimate the density (see [16, Sec. 2.7] and the references therein for a survey of the estimation 
methods for a PPP). This section discusses a PLE estimation algorithm that uses the knowledge of the 
density A. 

A simple technique to infer the PLE 7 when the Nakagami parameter m is unknown is based on the 
mean interference. According to this method, nodes simply need to record the strength of the received 
power that they observe and use it to estimate 7. We first state existing theoretic results and subsequently 
describe how the estimation can be performed in practice. 

For the PPP network running the slotted ALOHA protocol, the n th cumulant of the interference resulting 
from transmitters in an annulus of inner radius A and outer radius B around the receiver node is given 
by [17] 

p>2—ii'y /\2—n / y 

C n = 2ir\pE G [G n } . (4) 

2 — n7 

4 This value of p was found to be suitable to obtain several quasi-different realizations of the PPP $' and helped obtain accurate estimates 
in a reasonable number of time slots. 



In particular, we can consider only the case 7 > 2 (a fair assumption in a wireless scenario) and let B 
be large (considering the entire network) so that the mean interference is 

[i I = Ci = 2vrAp -, (5) 
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which is independent of m. Consequently, the mean received power is jiR — Hi + N . Note from © that 
the mean received power is infinite for A = 0. However, the large-scale path loss model is valid only 
in the far- field of the antenna and breaks down for very small distances. Denote the (known) near- field 
radius by a positive constant A . 

The algorithm based on the interference moments matches the observed and theoretic values of the 
mean received power and is described as follows. 

• Record the strengths of the received powers R 1} . . . , R N at any arbitrarily chosen node during N 
time slots and evaluate the empirical mean received power (l/N) Y^iLi ^U- 

• Equate the empirical mean to the theoretical value of fiR (with A = A ) and estimate 7 using a 
look-up table and the known values of p, N and (known or estimated density) A. 

Fig. [T] depicts the relative MSE values of the estimated PLE 7 for different 7 and N values. The estimates 
are seen to be accurate over a wide range of PLE values, in particular when the PLE is small. Furthermore, 
the MSE is seen to converge within just about 1000 time slots, i.e., in a few seconds or less, depending 
on the hardware used. 

The estimate 7 can be used along with A to also estimate the Nakagami parameter m. Indeed, from 
©, the variance of the interference is 

/ 1 \ ,42-27 

a] = C 2 = 27r\p 1 + - (6) 

\ m J 27 — 2 

Since noise and interference are independent of each other, the variance of the received power is a R = 
a] + a%, where a 2 N denotes the (known) variance of the noise power. Therefore, an estimate of the fading 
parameter m is obtained by inverting © as 

-1 
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B. Algorithm 2: Estimation Based on Virtual Outage Probabilities 

We now describe an estimation method based on outage probabilities that requires the knowledge of 
neither the network density nor the Nakagami fading parameter. We first review some theoretical results 



and then present a practical scheme to estimate 7. 

In [11], it is shown that when the signal power is exponentially distributed, the success probability 
p s across any link in the Poisson network is equal to the product of the Laplace transforms of noise 
and interference. For the rest of this subsection, we assume that the system is interference-limited, i.e., 
N <C I. In particular, when the transceiver pair separation is unit>f[ we can express the probability of a 
successful transmission [17] as 

p s « exp(- Cl e 2 /T), (8) 

where 

c, = ApirE[G J/7 ]r (1 - 2 h ) = W( m + 2/ 7) r (1 -2/o0 

1 (m)m 2/7 

To estimate 7, the nodes are required to measure the SIR values during several time slots and use it 
to compute the empirical success probability, which matches the theoretical value ©. However, it is 
impractical to place transmitters for each receiver node where a SIR measurement is taken. Instead, nodes 
can simple measure the received powers, and compute the (virtual) SIRs taking the signal powers to be 
independent realizations of an exponential random variable. This algorithm is implemented at each node 
as follows. 

. Record the values of the received powers R±, . . .,Rn at the node during N time slots. Take the 
signal powers Si, 1 < i < N, to be N independent realizations of an exponential random variable 
with unit mean. Using the values Si/ R4, Vi, a histogram of the observed SIR values is obtained. 

• Evaluate the empirical success probabilities at two different thresholds, i.e., compute p s j = (1 /N) Y2i=i ^-{Si/Ri 
3 = 1,2. 

• Match the empirically observed values with the theoretical values: From ©, we obtain ln(p Si i) / ln(p s 2 ) = 
(61/62) • Solving for 7 yields the estimate 

= 2 M e 1 /e 2 ) 

ln(ln(p S) i)/ln(p Si2 ))' 

which is independent of both A and m. 
Fig. [2] plots the relative MSE of 7 for 61 = 10 dB and 62 = dB for different 7 and iV values. We see 
that the error is small when the PLE is small, but increases at larger values of the PLE. 

5 When the transmitter node is unit distance away from the receiver node, the PLE will not affect the received power strength. This case 
is particularly helpful for the implementation of this PLE estimation algorithm. 
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Fig. [3] plots the relative MSE of 7 versus m for various PLE values computed at the end of N = 10000 
time slots. Note that the algorithm performs more accurately at lower values of m. We provide an intuitive 
explanation for this behavior in Section IIV-DI 

We have seen that 7 can be estimated by measuring the outages at two specific values of the threshold. 
We now describe how to improve the estimation accuracy, at the cost of additional complexity. The idea 
is to find the best fit of the empirical distribution of the SIR to the theoretical value given by ([8]). Denote 
the observed values of the SIRs during N time slots by j3i, . . . , (5^. Let the empirical complementary cdf 
of the SIR at Pi be F{(3i). We use the Kolmogorov-Smirnov statistic [18] to define the goodness-of-fit. 
Accordingly, the estimate of the PLE is given by 



The distribution (or curve) fitting method works effectively because of the difference in the behavior of 
the outage probability with respect to the parameters A, m and 7. Specifically, consider the exponent in 
©, and let /(O) = C16 2 / 7 . In the plot of /(O) versus 6, a change in the value of A or m only scales 
the function /, while changing 7 skews it. Thus, the value of 7 that fits the distribution can be efficiently 
estimated by fitting the curve, even when A and m are unknown. 

Fig. |4] plots the relative MSE of 7 versus the number of time slots, for 7 = 3,4. To find the solution 
to (flOl) . we used the in-built function 'fminsearch' in MATLAB. The dashed lines indicate the MSE for 
the case where the estimation is performed using only two specific values of the threshold (as in ©) and 
depict the improvement in performance obtained by employing distribution fitting. Evidently, when A or 
m (or both) is known, the estimation accuracy can be improved further. 

C. Algorithm 3: Estimation Based on the Cardinality of the Transmitting Set 

Without the knowledge of the network density A or the Nakagami parameter m, the PLE can be 
accurately estimated also based on the connectivity properties of the network. In this subsection, we 
derive the average number of nodes that are connected to an arbitrary node in the network and describe 
a PLE estimation algorithm based on our analysis. 

For any node y, define its transmitting set T(y) as the group of transmitting nodes whom it receives a 
(correctly decodable) packet from, in a given time slot. More formally, for receiver y, transmitter node x 
is in its transmitting set if they are connected, i.e., the SINR at y is greater than a certain threshold 0. 
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argmm max 

7 l<i<iV 




(10) 



1 1 



Note that this set changes from time slot to time slot. Also note that for = dB, the condition for a 
transceiver pair to be connected is that the received signal strength is greater than the interference power. 
Thus, for > 1, the cardinality of the transmitting set, \T(y)\, can at most be one, and that transmitter is 
the one with the best channel to the receiver. The estimation algorithm is based on matching the theoretical 
and empirical values of the mean number of elements in the transmitting set. The following proposition 
forms the basis of this estimation scheme. 

Proposition 4.1: Under the conditions of m e N and iV <C /, the mean cardinality of the transmitting 
set of any arbitrary node in the network, Nt, is proportional to 0~ 2 / 7 . 

Proof: For iV <C J, the success probability for a transceiver pair at an arbitrary distance R units 
apart can be expressed as 

p s (R) = E/ [Pr (GRT 1 > 10 | I 



f°° m m 

I — — -a: m_1 exp(— mx)dx 
J iew r (m) 



oo 



(*) / T(m,xeR^m) Pl (x)dx, (11) 



T ( m ) Jo 

where (a) is obtained using ([I]) and (b) using the definition of the upper incomplete gamma function^ 
r(-, •). Here, pi(x) denotes the pdf of the interference. 

The expressions can be further simplified when m is an integer. For m e N, we have 

m— 1 - 

Ps(R) W (xBR' r m) k exp(-xeR~ f m)p I (x)dx 
k=o ' ^° 

@ E" W J - 1 ^M I {a)\^Rr ni (12) 

fc=0 

where Mj(s) is the moment generating function (MGF) of /. Here, (a) is obtained from the series 
expansion of the upper incomplete gamma function and (b) using the definition of the MGF. When the 
node distribution is Poisson, we have the following closed-form expression for the MGF [17, Eqn. 20]: 

M/(s) = exp(-Ap7rE G [G 2 / 7 ]r(l - 2/ 7 )s 2 / 7 ), for 7 > 2. 
Using this, we get 

p s (R)= exp (-c 2 R 2 )J2 [ - 1 ¥ ^[-) , rneN (13) 

fc=0 ' 



where c 2 = Ap7rE G (G 2 / 7 )r(l - 2/ 7 ) (0m) 2 / 7 = Cl (Qm] 



2/7 



6 Mathematica: Gamma[a,z] 



12 



Now, we consider an arbitrary receiver node y, shift it to the origin and analyze the transmitting set 
for this "typical" node. Consider a disk of radius a centered at the origin. Let E denote the event that 
an arbitrarily chosen transmitter inside this disk is in y's transmitting set. Since the nodes in the disk are 
uniformly randomly distributed, we have 

Pt(E) = E R [ Ps (R) | R] 

™ 2 Jo t^o k] \ 7 / 



« 2 fo V 7 J Jo k\ 



(a) 



1 m_1 /2\ fe 1 

i^E -J n (r(* + i)-r(* + W)) 



a _ 

fc=0 



where (a) is obtained by a simple change of variables (c 2 r 2 = t) and (b) using the definition of the upper 
incomplete gamma function. 

Denote the mean number of transmitters in the disk of radius aby N a = Xpwa 2 . Then, we can write 

N T = E\T(y)\ = lim N a Pi(E) 



Xpir ^-4 / 2 \ k Xpn ^ 



in 



C 2 V7/ 7 C 2 1-| 



2 



fc=0 

r(m) 

6) i V7/ 7 — (14) 

= r(m + |)r(2-|)eV7- 1 } 

Here, (a) is obtained using the fact that lirn^oo T(a, z) = and (6) using the definition of c 2 and ©. ■ 
The analytical value of the mean cardinality of the transmitting set when m £ N can be evaluated from 
(fl~4l) . It is plotted in Fig. [5] for two different thresholds and 7 = {2.5,3,3.5,4,4,5}. Interestingly, since 
T(2 + 2/7) = (1 + 2/7)r(l + 2/7), the values of N T at m = 1 and m = 2 are the same. 

From ([141 . we see that iV T is inversely proportional to 6 2 ^ 7 . Therefore, when m is a positive integer, 
the ratio of the mean cardinalities of the transmitting set at two different threshold values is independent 
of m. This forms the main idea behind the estimation algorithm, and we surmise that this behavior holds 
at arbitrary m £ IR + . 

The algorithm based on the cardinality of the transmitting works at each node in the network as follows. 
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• For a known threshold value Q 1 > 1, set N Ti i(i) = 1 in time slot i, 1 < i < N, if the condition SIR 
> ©x holds, i.e., the node can correctly decode a packet and iV T1 (z) = otherwise. Compute the 
empirical mean cardinality averaged over several time slots, Nt,i = Sill 

• Likewise, evaluate 7VV,2 = 0-/N) Yli=i Nrpii) f° r another threshold value, 6 2 > 1. 

• Equate the mean cardinalities of the transmitting set for the two different threshold values to obtain 
N T ,i/N T ,2 = (0 2 /©i) 2/7 . Following this, 7 is estimated as 

= 2in(e a /e 1 ) 

' HNT,i/N T>2 y 

Thus, this algorithm requires the knowledge of neither A nor m. 

Since the performance of this algorithm depends on knowing whether packets are correctly decoded 
or not, it is advisable to keep time slots across different nodes synchronized so that the SIR at any node 
remains the same throughout a slot of time (or the packet transmission time). However, this assumption 
may be relaxed by making the duration of a time slot large relative to the packet transmission time, and 
setting Nr(i) = 1 even if at least one of the received packets is decoded successfully. 

Fig. [6] plots the empirical relative MSE of 7 for algorithm 3 versus the number of time slots N for 
various PLE values, while Fig. [7] shows the relative MSE of 7 versus m for iV = 10000. Unlike Algo. 
1 and 2, the relative MSE decreases with 7 in this case. Also, we observe that the MSE is low at lower 
values of m and increases with m. 

D. Discussion 

The problem of PLE estimation is fundamental and non-trivial. Each of the three algorithms we have 
described is fully distributed and can be performed at each node in the network. There is no need for 
coordination among nodes, and they do not require any information on the locations of nodes in the 
network or the Nakagami parameter m. Simulation results validate that the estimates are quite accurate 
over a large range of the system parameters 7 and m. Based on the relative MSE values, we conclude 
that at low values of 7, Algo. 1 performs the best (though it requires the network density to be known), 
while when 7 is high, Algo. 3 is preferred. If time slots across nodes are not synchronized, Algo. 2 is 
useful. Also, the convergence of the MSE is seen to occur within about 2000 time slots for all of the 
algorithms. For time slots of the order of milliseconds, it takes only a few seconds for the PLE to be 
estimated in practice. 



14 

Each of the estimation algorithms works by equating empirically measured values of certain network 
characteristics with their corresponding theoretical values. There is a caveat though, that needs to be 
addressed. The theoretical results used are for the "average network" (they are obtained by averaging over 
all possible realizations of the transmitter locations and channel states). However, in practice we have 
only a single realization of the node distribution at hand. Thus, even though the set of transmitters and the 
fading component of the channel change independently in different slots, the node locations remain the 
same and the interference at the nodes are spatio-temporally correlated [19]. This means that in general, 
the empirically computed values only approximate the theoretic results and the estimates are biased. The 
bias (and the MSE) can be significantly lowered if the nodes that record measurements have access to 
several independent realizations of the PPP and we use this idea later to improve the estimation accuracy 
(see Section |V]). The ALOHA MAC scheme turns out to be particularly helpful in this regard. 

The fact that nodes have access to just a single realization of the PPP also intuitively explains why the 
relative MSE decreases with m for Algorithms 2 and 3. Indeed, the variance of G is 1 + 1/m (obtained 
from ©), which increases with decreasing m. Considering the fading and link distance ambiguities jointly, 
a lower m corresponds to having greater randomness in the location of the nodes (upon taking the fading 
component to be a constant). Thus, this condition is equivalent to nodes being able to see several diverse 
realizations of the process over different time slots, and leads to a lower MSE. 

V. Improving the Estimation Accuracy 

As mentioned, the values measured at the nodes match the theoretical values more closely if each node 
has access to a larger number of realizations of the process. Fortunately, in the scenario where nodes are 
distributed as a homogeneous PPP, we can employ two simple ways based on this principle to improve 
the estimation accuracy. We describe them in this section and also provide simulation results on the MSE 
that validate the significant improvement in the performance of the estimation algorithms. 

A. Mobile nodes 

Assume that nodes are mobile and that in each time slot, they each move with a constant velocity 
v in a randomly chosen direction 0, that is uniformly distributed in [0, 2ir) (random waypoint mobility 
model). Since the nodes move independently, each node observes a different PPP realization (with the 
same density) in each different time slot. By recording measurements over different time slots, 7 can be 
estimated more accurately. Fig. [8] plots the relative MSE of 7 for each of the three methods when v — 0.1 
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m/slot. The dashed lines represent the relative MSE when mobility is not considered, and are also plotted 
to depict the improvement in performance. 

B. Coordinating Nodes 

Alternatively, if nodes in a neighborhood can coordinate and exchange information, then their mea- 
surements can be combined to yield a more accurate estimate of the PLE. Since the homogeneous PPP 
is ergodic, for any measurable function defined on $, its statistical average (obtained over different PPP 
realizations), and its spatial average (obtained over different nodes in a single realization) are equal almost 
everywhere in the limit [16, pp. 172]. Based on this result, the estimation process can be performed more 
accurately on a single realization of the network by collecting the recorded measurements over several 
nodes. 

Fig. [9] plots the relative MSE of the estimate versus the number of coordinating nodes K. From the 
figure, we see that the MSE sharply reduces with larger K . As mentioned earlier, with K — > oo, the 
relative MSE — »■ 0. To obtain the simulation results, we used the collective measurements recorded at 
the K nodes located closest to the network center over N = 2000 slots. The estimates are based on the 
averaged value of these measurements. 

VI. Sensitivity of the Algorithms 

We have formalized our algorithms based on the homogeneous PPP model and a spatially invariant 
path loss assumption. However, in reality, it is more likely that the nodal arrangement is not completely 
spatially random but takes on other forms such as being clustered or more regular. Also, the PLE value 
changes depending on the terrain type and the environmental conditions and hence cannot always be taken 
to be a constant over the entire network. In this section, we briefly comment on the sensitivity of our 
algorithms to these issues and illustrate how the PLE may be accurately estimated well even when some 
assumptions are relaxed. 

A. Spatial Invariance of the PLE 

In this subsection, we address the case where the PLE is not spatially invariant. For illustration purposes, 
we consider part of a network consisting of a square subregion A of side / centered at the origin with PLE 
7i, and an outer region B of PLE 72 as shown in Fig. [101 To model the path loss, we use the multi-slope 
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piecewise linear model [1]. Accordingly, for transmitter node n\ and receiver node n 2 , the path loss over 
a distance r\ + r 2 (see Fig. [10]) is (r 1 /r )~' yi • (1 + r 2 /r 1 ) -72 , for r\ > r , where r is the near-field radius. 

Under this setting, we study the error performance of Algorithm 3 for different locations along the x- 
axis. Fig. [TT] plots the relative MSE of 7 when I = 10 with 71 = 4 and 72 = 3 and shows that Algorithm 
3, by itself, works quite accurately even in a network with two different values of 7. The same qualitative 
behavior can be expected from Algorithms 1 and 2. 

For cases where the PLE varies more rapidly or when the network is sparse, nodes can coordinate 
to obtain better estimates. To accurately do this, it is helpful if nodes have a general idea of the PLE 
coherence length, i.e., the distance over which the PLE can be assumed to be invariant. This may vary 
from about a mile if the network terrain changes rapidly to as much as hundreds of miles if the network 
extends from an urban to a suburban to a rural area. It can be assumed that the network operator has a 
general idea about variations in the PLE and based on this, the network is divided into sub-areas with 
constant PLEs, each of which is estimated separately. For instance, if the PLE coherence length is d, 
each node can estimate the PLE based on measurements recorded by other nodes that lie inside a disk of 
radius d around it. 

B. Other Point Process Models 

Even though all the algorithms are formulated for the case of the homogeneous PPP, they may also be 
used to estimate the PLE in different spatial point process patterns. The idea is to artificially make the 
arrangement of nodes appear more "spatially random". This can be effectively performed upon simply 
employing randomized power control, wherein instead of having all the nodes transmit at unit power, we 
let nodes transmit at power levels drawn from a certain distribution. In fact, with independent thinning and 
appropriate rescaling, every point process is transformed into a stationary PPP (in the limit p — > 0) [21, 
Prop. 11.3.1]. A good choice for the distribution of transmit power levels is the exponential distribution 
since it is also the maximum non-negative entropy distribution [20], i.e., among all continuous pdfs 
supported on [0, 00) with a given mean, the exponential distribution has the maximum entropy. 

Upon employing power control, the algorithms designed for the PPP case may also be used to estimate 
the PLE for other point processes. Fig. [T2]plots the relative MSE values for Algorithm 3 for three non-PPP 
network models: the regular lattice grid, the Matern hard core process and the Thomas cluster process 
(for details on these point processes, please refer to [16]), and shows that the algorithm's performance is 
quite accurate when power control (based on the exponential distribution) is employed. Similar qualitative 
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behavior may be expected of Algorithms 2 and 3. Using power control improves the estimation accuracy 
even in the case that the nodal arrangement is a PPP since it helps realize diverse realizations of the PPP. 

VII. Summary 

In wireless systems, knowing the value of the PLE is critical, thus an accurate estimate is essential 
for their analysis and design. We offer a novel look at the issue of PLE estimation in a large wireless 
network, by taking into account Nakagami-m fading, the underlying node distribution and the network 
interference. We assume that nodes are arranged as a homogeneous PPP on the plane and the MAC 
scheme is slotted ALOHA (at least during the PLE estimation phase). For such a system, we present 
three distributed algorithms for PLE estimation, and provide simulation results to quantify the estimation 
errors. By incorporating mobility or coordination among nodes, the accuracy of the estimators can be 
greatly improved. The proposed algorithms perform well even when the PLE is spatially varying, and upon 
employing power control, in networks with more general node distributions. To the best of our knowledge, 
they are the first fully distributed PLE estimation algorithms in wireless networks with interference. This 
work is extensible to one- or three-dimensional networks in a straightforward manner. 
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Fig. 1. Relative MSE of 7 versus the number of time slots for different PLE values, for the estimation method based on the mean 
interference. The error is small, in particular when the PLE is small. 




Fig. 2. Relative MSE of 7 versus the number of time slots for the estimation method based on virtual outage probabilities. Alike Algo. 1, 
the estimation error increases with larger y. 



20 



X = 1 , p = 0.05, N = -25 dBm, 0=10 dB, 0=0 dB, N = 1 0000 




Nakagami parameter m 



100 



Fig. 3. Relative MSE of 7 versus the Nakagami parameter m for different PLE values, for the estimation method based on virtual outage 
probabilities. Note that this algorithm performs more accurately at lower values of m. 
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Fig. 4. Relative MSE of 7 versus N for Algorithm 2 for 7 = 3, 4 (dashed lines) and the same upon employing the distribution-fitting 
based technique (solid lines). The latter method is seen to drastically improve the estimation accuracy, in particular when 7 is large. 
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Fig. 6. Relative MSE of 7 versus the number of time slots for different PLE values, for the estimation method based on the mean cardinality 
of the transmitting set. In contrast to Algo. 1 and 2, the relative MSE decreases with increasing 7. 
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X= 1,p = 0.05, N = -25 dBm, © 1 = 10 dB, © 2 = dB, N = 10000 
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Fig. 7. Relative MSE of 7 versus the Nakagami parameter m for the estimation algorithm based on the mean cardinality of the transmitting 
set. The estimates are more accurate at lower m. 
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Fig. 8. Relative MSE of 7 versus the number of time slots, for the three estimation algorithms, with and without consideration of mobility. 
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Fig. 10. The considered Poisson network model with the square subregion A having a different value of PLE, 71 compared to the rest of 
the network B. The attenuation between nodes m and ni is modeled by a piecewise linear path loss model. 
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Fig. 11. Relative MSE 7 for algorithm 3 at different locations (x,0). The true values of the PLE are 71 = 4 for x < 5 and 72 = 3 for 

x > 5. 
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Fig. 12. Relative MSE of 7 versus the number of time slots, for three non-PPP models. In each case, Algorithm 3 is found to estimate 
accurately. 



